NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Discovering Data Structures: Nearest Neighbor Search and Beyond

Salemohamed, Omar; Charlin, Laurent; Garg, Shivam; Sharan, Vatsal; Valiant, Gregory (December 2025, Advances in Neural Information Processing Systems)

Free, publicly-accessible full text available December 1, 2026
Correlated Errors in Large Language Models

Kim, Elliot_Myunghoon; Garg, Avi; Peng, Kenny; Garg, Nikhil (June 2025, International Conference on Machine Learning)

Diversity in training data, architecture, and providers is assumed to mitigate homogeneity in LLMs. However, we lack empirical evidence on whether different LLMs differ \textit{meaningfully}. We conduct a large-scale empirical evaluation on over 350 LLMs overall, using two popular leaderboards and a resume-screening task. We find substantial correlation in model errors---on one leaderboard dataset, models agree 60% of the time when both models err. We identify factors driving model correlation, including shared architectures and providers. Crucially, however, larger and more accurate models have highly correlated errors, even with distinct architectures and providers. Finally, we show the effects of correlation in two downstream tasks: LLM-as-judge evaluation and hiring---the latter reflecting theoretical predictions regarding algorithmic monoculture.
more » « less
Free, publicly-accessible full text available June 18, 2026
Toward Weight Sharing Paradigm for Efficient AI: Training and Inference Serving

https://doi.org/10.1145/3759441.3759447

Behnam, Payman; Khare, Alind; Garg, Dhruv; Tumanov, Alexey (August 2025, ACM SIGOPS Operating Systems Review)

Deep neural networks are increasingly required to operate across diverse hardware platforms, latency constraints, and power budgets, which motivates the need for specialized models for each scenario. However, designing and training a separate model per scenario or serving a large ensemble of models is often impractical. Weight sharing has emerged as a promising paradigm to address this challenge by training a single ''SuperNet'' that subsumes many sub-models (SubNets), and by reusing weights across those SubNets both at training and inference time. This paper provides an abridged survey of our recent advances that leverage weight sharing for efficient AI, covering both training and inference serving. In centralized once-for-all training, Delayed ε-Shrinking (DεS) improves training efficiency by strategically scheduling the introduction of smaller SubNets during training. In a federated fashion, SuperFedNas co-trains a SuperNet across distributed clients and disjoins training and searching, which enables oneshot specialization to many deployment targets at minimal cost. ∇QDARTS integrates quantization into differentiable architecture search, jointly finding neural architectures, weights, and low-precision settings to yield highly efficient models in a single search. For inference serving, SuperServe introduces a weight-shared model with dynamic SubNet routing (SubNetAct) to instantaneously switch among a spectrum of accuracy-latency operating points, coupled with a scheduler (SlackFit) for unpredictable workloads. Finally, SUSHI co-designs model, system, and accelerator to exploit weightshared SuperNets on tinyML devices, caching SubGraphs on FPGA to reduce latency and energy. Together, these works demonstrate that the weight sharing paradigm can dramatically improve the efficiency of both training and inference serving of deep models across a range of scenarios.
more » « less
Free, publicly-accessible full text available August 4, 2026
Spatial Audio Processing with Large Language Model on Wearable Devices

Mishra, Ayushi; Bai, Yang; Narayanasamy, Priyadarshan; Garg, Nakul; Roy, Nirupam (July 2025, International Conference on Machine Learning (ICML))

Integrating spatial context into large language models (LLMs) has the potential to revolutionize human-computer interaction, particularly in wearable devices. In this work, we present a novel system architecture that incorporates spatial speech understanding into LLMs, enabling contextually aware and adaptive applications for wearable technologies. Our approach leverages microstructure-based spatial sensing to extract precise Direction of Arrival (DoA) information using a monaural microphone. To address the lack of existing dataset for microstructure-assisted speech recordings, we synthetically create a dataset called OmniTalk by using the LibriSpeech dataset. This spatial information is fused with linguistic embeddings from OpenAI’s Whisper model, allowing each modality to learn complementary contextual representations. The fused embeddings are aligned with the input space of LLaMA-3.2 3B model and fine-tuned with lightweight adaptation technique LoRA to optimize for on-device processing.
more » « less
Free, publicly-accessible full text available July 13, 2026
Smaug: Modular Augmentation of LLVM for MPC

https://doi.org/10.1109/SP61157.2025.00188

Garg, Radhika; Wang, Xiao (May 2025, IEEE)

Free, publicly-accessible full text available May 12, 2026
Sparse Autoencoders for Hypothesis Generation

Movva, Rajiv; Peng, Kenny; Garg, Nikhil; Kleinberg, Jon; Pierson, Emma (June 2025, International Conference on Machine Learning)

We describe HypotheSAEs, a general method to hypothesize interpretable relationships between text data (e.g., headlines) and a target variable (e.g., clicks). HypotheSAEs has three steps: (1) train a sparse autoencoder on text embeddings to produce interpretable features describing the data distribution, (2) select features that predict the target variable, and (3) generate a natural language interpretation of each feature (e.g., mentions being surprised or shocked) using an LLM. Each interpretation serves as a hypothesis about what predicts the target variable. Compared to baselines, our method better identifies reference hypotheses on synthetic datasets (at least +0.06 in F1) and produces more predictive hypotheses on real datasets (~twice as many significant findings), despite requiring 1-2 orders of magnitude less compute than recent LLM-based methods. HypotheSAEs also produces novel discoveries on two well-studied tasks: explaining partisan differences in Congressional speeches and identifying drivers of engagement with online headlines.
more » « less
Free, publicly-accessible full text available June 18, 2026
Balancing Producer Fairness and Efficiency via Prior-Weighted Rating System Design

https://doi.org/10.1609/icwsm.v19i1.35865

Ma, Thomas; Bernstein, Michael S; Johari, Ramesh; Garg, Nikhil (June 2025, Proceedings of the International AAAI Conference on Web and Social Media)

Online marketplaces use rating systems to promote the discovery of high-quality products. However, these systems also lead to high variance in producers' economic outcomes: a new producer who sells high-quality items, may unluckily receive a low rating early, severely impacting their future popularity. We investigate the design of rating systems that balance the goals of identifying high-quality products (``efficiency'') and minimizing the variance in outcomes of producers of similar quality (individual ``producer fairness'').We show that there is a trade-off between these two goals: rating systems that promote efficiency are necessarily less individually fair to producers. We introduce prior-weighted rating systems as an approach to managing this trade-off. Informally, the system we propose sets a system-wide prior for the quality of an incoming product; subsequently, the system updates that prior to a posterior for each product's quality based on user-generated ratings over time. We show theoretically that in markets where products accrue reviews at an equal rate, the strength of the rating system's prior determines the operating point on the identified trade-off: the stronger the prior, the more the marketplace discounts early ratings data (increasing individual fairness), but the slower the platform is in learning about true item quality (so efficiency suffers). We further analyze this trade-off in a responsive market where customers make decisions based on historical ratings. Through calibrated simulations in 19 different real-world datasets sourced from large online platforms, we show that the choice of prior strength mediates the same efficiency-consistency trade-off in this setting. Overall, we demonstrate that by tuning the prior as a design choice in a prior-weighted rating system, platforms can be intentional about the balance between efficiency and producer fairness.
more » « less
Free, publicly-accessible full text available June 7, 2026
VOccl3D: A Video Benchmark Dataset for 3D Human Pose and Shape Estimation under real Occlusions

Garg, Yash; Bachu, Saketh; Dutta, Arindam; Lal, Rohit; Bose, Sarosij; Ta, Calvin-Khang; Asif, M Salman; Roy-Chowdhury, Amit (August 2025, International Conference on Computer Vision (ICCV))

Free, publicly-accessible full text available August 9, 2026
Sparse Autoencoders for Hypothesis Generation

Movva, Rajiv; Peng, Kenny; Garg, Nikhil; Kleinberg, Jon; Pierson, Emma (May 2025, ICML)

Free, publicly-accessible full text available May 30, 2026
Deciphering Photochemical Reactivity of Maleimides by Ultrafast Spectroscopy: How Minor Pathways Have Major Implications in Photochemical Reactions

https://doi.org/10.1021/acs.jpca.5c01860

Garg, Dipti; Tarnovsky, Alexander N; Sivaguru, Jayaraman (May 2025, The Journal of Physical Chemistry A)

Free, publicly-accessible full text available May 1, 2026

« Prev Next »

Search for: All records